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Abstract 

A four index notation {e.g. (1011)) is often used to denote reciprocal lattice vectors or crystal 
faces of hexagonal crystals. The purposes of this notation have never been fully explained. This 
note clarifies the underlying mathematics of a symmetric overcomplete basis. This simplifies and 
improves the usefulness of the notation. 
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I. INTRODUCTION 



— * — * — * — * 

A reciprocal lattice vector (RLV) has the form G = uaA + ubB + tiqC, where A, B, C 
are primitive translations of the reciprocal lattice. The integer "indices" {nA,nB,nc) were 
originally introduced as the Miller indices, which are reciprocals of the intersection points (in 
units of a = \a\, etc.), of a crystal lattice plane with the axes along the primitive translation 
vectors a, b,c of the direct lattice. The "4-index" notation (1011) in a hexagonal crystal 
denotes an RLV with {nA,nB,nc) = (1,0, 1). The extra, or third, index (1 = —1) in (1011) 
is redundant. This index must equal the negative sum of the first two indices. The notation 
restores symmetry between equivalent directions which is lost if the third index is omitted. 
This notation has long been used by crystallographers, dating back at least to Fedorov in 
1890^. An extra Miller index is natural in hexagonal symmetry, because, rather than two 
a6-plane axes a, b at 120°, the three symmetrical axes ai, 02, as, shown in Fig. 1, are natural. 
The plane of atoms intersects all three axes. The reciprocal intersection points (hki) are 
forced by trigonometry to obey the rule h + k + i = 0. The proo^ is hinted in Fig. 1. 

Even before Bragg scattering was observed and explained (1912-13), the mathematical 
concept of a dual or reciprocal lattice was used^. After 1912, physicists recognized the RLV 
as an x-ray momentum transfer. The "indices" which label planes seem secondary unless we 
are specifically studying an atomic plane. The redundant index may seem only a nuisance. 
Since the advantages of the 4-index notation are not always understood, it is common to 
omit the third index, as would be done in crystal systems which lack 120° rotations. This 
note is written in the belief that, once the underlying idea is clearly understood, the four 
index notation is natural. It can be used to some advantage to label RLV's (hkil) and also 
to label directions [stru] (called "zones" when the direction perceived as the common axis 
of a family of planes) in the direct lattice. However, the 4-index labeling of directions [stru 
that emerges in my analysis makes a subtle improvement over the one in use in electron 
microscopy Otte and Crocker- discuss notation carefully, but with a different aim. 

Here is the main idea. In an hexagonal crystal, define four unit vectors, with the fourth 
(64) pointing along the c-axis. The other three lie in the afe-plane, at 120° to each other, 
as shown in Fig{TJ It is conventional to have them point in the directions of primitive 
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translations of the lattice. Then an arbitrary 3-vector v is written with four components, as 
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If -y is a position vector R in "real" space, then it can be written as 
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where the indices rii are dimensionless, and squared brackets conform to the crystallographic 
convention for directions in coordinate space. If i? is a translation vector of the lattice, then 
the indices are integers. If -u is a vector of the reciprocal lattice Q, then it can be written 



Q = 27r 
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where again the indices are dimensionless, and rounded brackets conform to convention. If Q 
is a translation vector G of the reciprocal lattice, then the dimensionless indices are integers, 
and the shorthand G = (^i?t-2't-3^4) is common. In this paper, I will stick to a convention that 
vectors written in column form as in Eqs. (12][5i) . or in row form with components separated 
by commas, contain the full dimensioned components, v ■ Ci. Dimensionless abbreviations 
are never implied except in the index notations [nin2n^n^ and (?^i?t-2^3^4) without commas. 
The term "index" will always refer to a dimensionless version of a component of a vector. 

In Dirac notation, \v) indicates a column vector, while (f | indicates a row vector. However, 
in this paper notation switches between two and three dimensions and between 2-vector, 3- 
vector and 4-vector systems. The Dirac notation is mostly avoided to reduce ambiguity. 
When a vector is written alone, it can be assumed a column vector. When written with 
another vector in a dot product u ■ v, the left vector is a row vector and the right vector a 
column vector. When written as a dyad vv, the left vector is a column vector and the right 
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FIG. 1. (a) a6-plane of a hexagonal crystal, showing a = ai and 6 = 02 axes, the third axis (03). In 
red is shown the line of intersection of a plane of atoms with the ab plane. This red plane intersects 
the ai axis at ri, the 02 axis at r2, and the 03 axis at —r^. It is always possible to choose the origin 
such that the intersection is at ri = 1. Then the 120° geometry guarantees that l/r3 = l/r2 + 1. 
(b) Plaquette of 2-d hexagonal crystal, showing primitive translations a, b of the direct lattice and 
A, B of the reciprocal lattice. 



vector a row vector. When written in a line of text with commas, the vector (a, 6, c, d) is 
probably a column vector written sideways to save space. 

The 4-vector "basis" is overcomplete. It has both conventional and unconventional fea- 
tures. The basis vectors are, in 4-component notation 
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Note that the sum of the first three indices is always zero, which follows from the fact that 
the first three unit vectors add to zero, ei + 62 + 63 = 0. It is perhaps disconcerting that the 
unit vectors 61,62,63 do not "look like" unit vectors in the 4-vector notation, but this is a 
consequence of overcompleteness. For example, the second component of the 4-vector that 
represents ci, is, according to Eq.([T]), ei ■ 62 = cos 120° = —1/2. 

The notation makes symmetry explicit. The six translations have their first three indices 
chosen by the rule, organize three integers chosen from ±1 and ±2, in all ways such that 
they add to zero. Because of the symmetrical mathematics, the scalar product {e.g. u ■ v = 



UxVx + UyVy + UzVz) in 4- vector form is simple but unconventional: 
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u-v ^ -{uiVi + U2V2 + U3V3) + UiVi. (5) 
This can easily be verified, and will be explained in Sec. III. 

II. CONVENTIONAL ("BI- ORTHOGONAL") BASIS 

When one drops the redundant third index from the 4-index notation, the remaining in- 
dices express vectors in a non-orthogonal basis. There is perfectly good mathematics behind 
this, but it hides symmetry and simplicity. In a hexagonal system, these disadvantages are 
removed by the 4-index system. In systems of lower symmetry, where translation vectors 
are not at 90° and 120° to each other, such simphfication is not available. In this section, 
the mathematics of non-orthogonal basis sets is reviewed. 

To simplify, the c-direction is now omitted. The discussion thus refers to crystallography 
of hexagonal crystals in two dimensions. The third dimension returns in Sec. IV. In the 2-d 
space of the ah plane, any two vectors that are not parallel or antiparallel can be chosen for 
a basis. The most obvious choices are the primitive translations a and 6, or alternately, the 

— * — * — * — * 

primitive translations A and B of the reciprocal lattice, defined as A — {2TT/V)b x c, and 
similarly for B. V is the volume of the three-dimensional unit cell, V — (-\/3/2)a^c. Then 
we have the usual vector relations A ■ a — 2% — B ■ b and A ■ b — — B ■ a. These relations 
indicate that the basis sets (a, b) and (A, B) are bi-othogonal. 

It not always mentioned in texts that the primitive direct lattice vectors and the primitive 
RLV's are examples of the mathematical notion of bi-orthogonal basis vectors. The impor- 
tant property is completeness: any vector can be expressed as a unique linear combination 
V — Vaa + Vf,b. Note that v-a — Vaa'^ + Vf,a-b, where a • 6 equals — a^/2 in hexagonal systems. 

— * 

The coefficients Va and Vf, are found by solving a 2 x 2 linear system in a and v-b. A nice 
aspect of bi-orthogonality is that it diagonalizes the 2x2 system and gives simple formulas 
for the coefficients, namely Va = A ■ v/27r, Vi, = B ■ v/27r. 

It is arbitrary which basis (direct space or reciprocal) is taken to be primary and which 

— * — * 

to be dual. Thus, an arbitrary vector v has an alternate representation, v — v^A + vbB. 

— * 

The coefficients are va — a ■ v/2n, vb — b ■ v/2n. The inner product u • v of two vectors 
is not given simply by + v^, but involves also the cross term VaVf,- The simple formula 
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is u ■ V = 27i{uAVa + UB^b), where the row vector is expressed in the basis dual to the one 
chosen for the column vector. An equivalent formula is u - v = 27r{uaVA + UbVB)- A compact 
mathematical representation of these relations is given by the equation \ A) (a| + |-B) (6| = 27rl, 
or by the alternate equation + \b){B\ = 27rl. Here the notation \v) means a 2- vector, 

and 1 is the 2x2 unit matrix. If written as a 2-component column vector, the basis should 
either be orthonormal, or if non-orthonormal, one has to be careful to use the direct and 
dual basis for the column vector \v) and the row vector (f |. In dyad notation, the relations 
are 

Aa + Bb = aA + bB = 27t1. (6) 

This formula is called "the completeness relation" or, equivalently, "the decomposition of 
unity." Although this gives elegant formulas for inner products in non-orthogonal basis sets, 
these formulas are not likely to be used unless the vectors u and v belong separately, one 
to direct, and the other to reciprocal space. Then the formulas are obvious. One has no 
trouble realizing that Q ■ r = {QaA + QbB) ■ (r^a + rbb) is equal to 27r((5yi^a + Qsrh)- 



III. OVERCOMPLETE SYMMETRICAL BASIS 



For two dimensional vectors, or the aft-plane components of 3-d vectors of an hexagonal 
crystal, the overcomplete symmetrical basis is the three vectors 61,62,63 which lie at 120° 
to each other, as shown in Fig{Tl The key relationship is the dyad formula 

2 

-(6161 + 6262 + 6363) = !. (7) 

This decomposition of unity is a very nice alternative to Eq.(j6]). It is no longer necessary 
to have dual sets of direct and reciprocal lattice vectors. The three vectors are self-dual. 
The scalar product of any two 2-vectors can be written as 
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U-V = U- l- V=-{u- 6161 ■ V + U ■ 6262 ■ V + U ■ 6363 ■ v). (8) 
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This formula can be written in an overcomplete 3-vector notation as 
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where the usual array multiphcation rule is obeyed. Unlike the case of the biorthogonal 
basis, here there is no need for care about whether -u or w is in direct or reciprocal space, or 
whether the primary or the dual basis is implied. The formula works for any two vectors. 

It is worth emphasizing one subtlety. In the bi-orthogonal basis, when a column vector 
has indices [vaVb\ , this means that v = VaS + Vbb, even though Va ^ v ■ a. Similarly, if the 
indices are written (vaVb), this indicates a vector v = vaA + vbB. Very different relations 
hold true in the symmetric overcomplete basis. If the vector has components fi,f2,f3, this 
means that vi = v ■ ei, etc., even though v ^ fiCi + ^262 + ^363. The actual answer is 
V = (2/3)(fiei + V2e2 + ^363). This follows immediately from Eq.(I7]). It does not matter 
whether the vector v is in direct space and indexed as [f if2^^3] or in reciprocal space and 
indexed as (f if 2^3). 

Now let us express the reciprocal lattice vectors in the overcomplete symmetric notation. 
First, recall that since A-a = {y/3/2)\A\a = 27r, we have \A\ = {2/y/3){27i/a). Then we find 
A - ei = {\/3/2)\A\ = 27[/a, A - 62 = 0, and A ■ = —271 /a. Therefore we have 
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The set of six symmetry-related smallest RLV's {hki) are simply all permutations of the 
three indices 1, 0, —1. The general reciprocal lattice vector is written in various ways, as 



G = uaA + ubB {nAns), 



(11) 
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{haUb Ua + Ub). (12) 



It is amazing how similar the bi-orthogonal representation (Eq JTT]) is to the overcomplete 
symmetric representation (Eq JT^ for RLV's. They involve different coordinate systems and 
rules, yet the former derives from the latter by just dropping the extra third index, and the 
latter from the former by adding an index which is the negative sum of the first two indices. 

The additional advantages of the extra index representation are that it is completely 
explicit (if 27r/a is included, the vector is completely specified), and there is no ambiguity 
about the direct versus dual parts of the bi-orthogonal representation. 



IV. CONCLUSIONS 



It is best to think of Eq.([T]) as a way of representing the vector v but not to think of 
the components of the 4- vector as if they had meaning similar to the ordinary 3-component 
notation. In ordinary vector notation, if the vector v is denoted [faffefc], then it has the 
formula VaS+Vhb + VcC. Or, if it is denoted by (vaVbVc), then it has the formula vaA + vbB + 
VqC. In 4-index notation, vectors indexed as (^1712^3714) or [nin2n3n4\ have the formulas 
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Note the unconventional factor 2/3. When written as an additive sum of primitive vectors 
proportional to Cj, an arbitrary additive constant can be added to Mi, M2, M3 or fi, f2, ^3 with 
no algebraic or notational error. For example, 

/ {vi + C)a \ 

^ 2a. 



■[{vi + C)ei + {V2 + C)e2 + {v^ + 0)63] + v^ce^ 7^ 
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When written in overcomplete 4-vector notation, Vi,V2,V3 are fixed numbers, necessarily 
adding to zero, and therefore with no arbitrariness. The constant C cannot be added, even 
though the vector relation C(ei + 62 + 63) = is true. 

The inner product of two vectors, in 4-vector notation, is 

[2/3 \ f vi\ 
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This is just an alternative way of writing Eq.(|5]), that emphasizes the unconventional metric. 
A factor 2/3 occurs in the first three diagonal entries. The metric is positive, so the inner 
product is safely defined. Eq. (fTB]) holds for all vectors provided the indices Ui and Vi are 
written in full rather than abbreviated index form. For the special case where one of u, v is 
in direct and the other in reciprocal space, the inner product, modulo 2tt, also has this form, 
i.e. (hkil) ■ [stru] = 27r[(2/3)(/is + kt + ir) + lu], in the index form. If both are direct or 
reciprocal space vectors, an extra factor {c/aY or (a/c)^ appears in the fourth term. 

The unconventional metric with the factor 2/3 does not appear in previous literature. 
If the 4-index notation is confined to indexes of planes and reciprocal lattice vectors, then 
the current definition Eq.([T]) agrees with the one in the literature. It is less common for 
a 4-index notation to be used for coordinate space directions. Inner products are seldom 
discussed, and the factor 2/3 doesn't arise. 

For real space directions directions, the current definition Eq. ([1]) differs from used in the 
literature"^'^. Consider for example the lattice point R = nad+ribh+ricC. Using Eqs. lfTpj) . this 
is indexed as [{ua + niy/2) {n^ + fiajl) + nh/2) nj. The literature definition is different. 
It requires first writing R = naai + nhd2 + ncC— (1/3) {ua + nf,) {0,1 + 0,2 + ^3)). The extra term 
is zero but is added to make the sums of the coefficients of oi, 02, and 03 add to zero. By 
the literature definition, this vector is indexed as [stru] where the indices are the coefficients 
of di, etc., and s + t + r = 0. Thus we get R — )■ [2{na + nb)/3 {2nb + na)/3 {ria + nfe/3) rZc]. 
The first three indices have been reduced by 2/3. It never seems to be mentioned that 
the definition was changed between direct and reciprocal space. The factor 2/3 is now 
incorporated into the definition of the first three indices str of [stru] of direct space vectors, 
but not into the first three indices (khil) of reciprocal space vectors. Then the inner product 
of a direct space vector [stru] and a reciprocal space vector (hkil) is 2Ti{sh + tk + ri + ul), 
without the extra 2/3. The penalty is that inner products of two direct space vectors or 
two reciprocal space vectors must be computed with awkwardly different rules, whereas the 
unified definition offered here gives a simple unified (but unconventional) rule. In order to 
retain the usual definitions, one could accept as a compromise the ad hoc rule that whenever 
dimensionless real space indices [stru] are written, they incorporate an extra factor 2/3 
beyond Eq.(Il]), in the first three indices. However, when written out in full column vector 
notation, there is no need for index notation, and such a compromise would be unwise. 
Eq.(IT]) should be used, and the inner product rule Eq.(l5]) applies. 
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Overcomplete basis sets are not abnormal in physics. They are used frequently for infinite- 
dimensional problems. A quantum harmonic oscillator, for example, has an infinite complete 
orthonormal basis of eigenstates |n), but the "coherent state" representation, which is over- 
complete, is often preferable^. In finite-dimensional vector problems, hexagonal crystallogra- 
phy is not unique; symmetry and orthonormality may compete and suggest an overcomplete 
symmetric representation. An example is electronic (i-states, where cubic symmetry lifts 
5-fold degeneracy into a 3-fold degenerate T2g manifold (spanned by orthonormal functions 
yz, zx, and xy) and the 2-fold degenerate Eg manifold (spanned by orthogonal functions 

— y"^ and Sz"^ — r^.) The orthogonal basis is nicely adapted to the 4-fold rotations of 
cubic symmetry, whereas these same rotations mix the orthogonal functions in an ugly 
way. A cure^ is an overcomplete non-orthogonal basis such as 3x^ — , 3?/^ — r ^ , and Zz^ — r^ . 
The mathematics of this representation is exactly the same as the symmetric overcomplete 
representation described here for hexagonal crystals. 
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